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ABSTRACT 


The  necessary  elements  of  the  calculus  for  factorials  as 
developed  by'Kurkjian  and  Zelen  are  described,  modified  where  necessary, 
and  applied  to  the  analysis  of  unbalanced  n-way  classifications  with 
fixed  effects.  Estimators  for  all  main  effect  and  interaction  effects 
parameters  are  obtained  along  with  the  associated  variances.  The  sums 
of  squares  for  each  effect  eliminating  all  other  effects  is  presented  in  a 
form  suitable  for  direct  computation.  This  form  results  in  considerable 
computational  saving  over  the  method  of  fitting  constants  used  in  general 
regression  theory.  The  results  are  applied  to  the  particular  case  of 


proportional  frequencies  in  the  subclasses. 


APPLICATIONS  OF  THE  FACTORIAL  CALCULUS  TO  GENERAL 
UNEQUAL  NUMBERS  ANALYSES^ 

W.  T.  Federer^’and  M.  ZelerA3 


1.  Introduction 

This  paper  is  the  second  in  a  series  of  papers  which  applies  the  calculus 

© 

for  factorial  arrangements  developed  by  Kurkjian  and  Zelen  [1962]  to  various 
problems  in  the  analysis  of  experiment  designs.  The  first  paper  dealing  with 
applications  [1963]  was  devoted  to  the  analysis  of  block  and  direct  product 
designs.  The  main  object  of  this  paper  is  to  apply  this  special  calculus  to  an 
alternate  way  of  treating  the  analysis  of  variance  with  unequal  numbers.  The 
usual  way  this  is  done  is  by  the  method  of  fitting  constants;  cf  Federer  [1957], 
Yates  [1934].  The  use  of  the  special  methods  developed  here  leads  to  substantial 
computational  savings  over  the  method  of  fitting  constants. 

Section  two  of  this  paper  contains  the  necessary  parts  of  the  factorial  calculus 
which  is  the  starting  point  of  our  investigation.  Section  three  develops  the 
general  theory  for  unequal  numbers  and  section  four  shows  the  resulting 
simplification  when  the  frequencies  are  proportional. 

^  Sponsored  by  the  Mathematics  Research  Center,  United  States  Army,  Madison, 
Wisconsin  under  Contract  No. :  DA~11~022~ORD~2059. 
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2.  Elements  of  the  Factorial  Calculus 

The  notation  and  special  operations  used  in  this  paper  will  be  a  modified 

version  of  the  calculus  for  factorial  arrangements  introduced  in  Kurkjian  and 

Zelen  [1962].  The  modifications  are  straightforward  generalizations  which  are 

useful  in  treating  the  case  of  unequal  numbers* 

Consider  a  factorial  experiment  with  the  n  factors  {Ag}  such  that  factor 

A  has  m  levels  for  s  =  1,  2,  ,  n  .  The  i^  treatment  combination 

consists  of  the  n-^uple  i  =  (i^,  i^,  . ..,  i  ^  where  ig  denotes  a  particular 

n 

level  from  factor  A  .  The  number  of  treatment  combinations  is  v  =  H  m 
s  s-1  s 

Let  Y  denote  a  v  x  1  random  vector  following  a  multivariate  normal  distribution 
with 

(2.1)  E(Y)  a  l|x  +  t 

(2.2)  V(Y)  =  cr2  N_1  . 

The  quantity  1  is  a  vx  1  vector  having  unity  elements ,  \x  is  a  scalar,  t  is 

a  v  x  1  vector  of  (fixed)  treatment  effect;  and  the  matrix  N  is  a  v  x  v  diagonal 

matrix  having  only  non-zero  diagonal  elements  n.  which  denote  the  number  of 

th 

observations  on  the  i  treatment.  The  elements  of  t  are  not  linearly 
independent,  but  satisfy  a  single  linear  restraint  which  will  be  described  later. 

Also  define  {ag},  s  -  1,  2,  . . .  ,  n,  to  denote  vectors,  termed  primitive 
elements,  such  that 


as  ^  ast2^  •  •  •  9  as(ms)]  • 
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New  elements  can  be  formed  with  the  operation  of  the  symbolic  direct  product 

(SDP)  which  is  denoted  by  ®  .  The  SDP  between  a  and  a  is  defined 

P 

to  be 


[ap  ®  aq]  -[a^Cl.l),  apqd,2),  •••,  ^1,  , 


(2.3) 


a  (2,1).  ....  a  (2,m  }.  ....  a.  (m  .1).  ....  a  (m  ,m  }] 
pqv  J  19  7  pqv  '  q/?  PQ  P  pq  P  q 


Note  that  the  subscript  refers  to  the  primitive  elements  involved  and  the  argument 

is  a  vector  of  two  elements  which  are  ordered  lexographically.  The  lexographical 

order  is  to  hold  the  first  element  of  the  argument  fixed  at  1  and  run  through  the 

levels  1,  2,  . . . ,  of  A^;  then  change  the  first  argument  to  level  2  of 

and  run  through  the  levels  of  A^;  etc.  The  SDP  is  also  defined  for  more  than 

two  primitive  elements  in  the  same  way;  i.  e. ,  a^  @  a^  0  a^,  etc.  The 

elements  of  a^  0  a^  denote  the  vector  whose  elements  are  the  parameters 

associated  with  the  two  factor  interaction  between  factors  A  and  A  ;  the 

P  q 

elements  of  a  0  a  0  a  denote  the  vector  associated  with  the  three  factor 

p  w  q  r 

interactions  among  factors  A  ,  A  ,  and  A  ,  etc.' 

p’  q7  r 9 

Let  x  be  a  variable  which  takes  on  the  values  0  or  1.  We  define 
s 


x 

s 

a 

s 


and  use  the  convention  that  a  0  1  =  a 

s  ^  s 


for  x  =  1 
s 


for  x  =  0 
s 


.  Then,  if  x  =  (x^?  x^, 
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generalized  interaction  may  be  denoted  by 

x.  x„  x 

x  1  2  ^  n 

a  =  ai  ®  a2  ®  •  ’  *  ®  an 


x  i 

and  will  have  m  =  II  m.  components. 
i=i  1 

The  model  relating  the  treatment  effects  to  the  interaction  parameters  can 
be  written  by  defining 


x 

I  S=  < 
s 


I  for  x  =1 
s  s 


1  for  x  =  0  ; 
s  s 


X  X  X 

ix  =  i. 1  xi.2  x  ...  xi  n  . 

.12  n  ? 


where  I  is  m  xm  identity  matrix  and  1  is  an  m  x  1  column  vector  having 
s  s  s  s  s 

all  elements  equal  to  unity.  Then  we  can  write 


(2.4)  t=2'lXaX, 

X 

where  x  =  (x. .  x  ...  x  )  and  the  summation  Y,  refers  to  all  the  2n  -  1 
x  2  n  ^ 

x 

n-digit  binary  numbers  x  excluding  x  =  (0?  0,  . . .  ,  0}  .  The  components  of  t 

are  taken  in  the  same  lexographical  ordering  as  the  n-factor  interaction. 

x 

The  interaction  parameters  a  are  not  linearly  independent.  Let  ug  be  a 
lxn  vector  with  all  elements  equal  to  zero  except  for  the  sth.  element  which  is 
equal  to  unity  and  define  r  by 
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vr  =  (1  X  1  X  . ..  X  1  )  N  (1  X  1  X  . . .  X  1  ) 
1  n  i  l  n 


We  then  define  the  x  mg  diagonal  matrix 


(2.5) 


m  u  ,  u 

w  =~{I  S)  N (I  s) 

s  rv 


s  =  1 ,  2 ,  . . .  ,  n 


The  elements  of  are  proportional  to  the  number  of  times  the  various  levels 

of  factor  appear. 

Then  a  convenient  set  of  restraints  among  the  parameters  may  be  taken  to 


1  W  a  =  0  p  --  1  '2,  . . .  .  n; 

p  P  P  ;  *  ,  •  * . 


1  XI  . 
p  ■■  q 


i  x  l 
p  q 


[Wp  X  Wq]  [ap  @  aq]  =  o,  p#  q  =  1,  2,  . . . ,  n  ; 


(2.6). 


i,  x  i;  x . ..  xi 

12  n 


L  XL’X...  XI 
12  n 


I,  X  I,  X  ...  X  1 
12  n 


[W1  X  W2  X  . . .  XW][al®a2®...@aJ 


With  these  restraints,  'it  can  easily  be  shown  that 
(2.7)  .  .  .  [lj  X  12-X  ..  .  X  lV|  [Wj  X  W2  X 


X  W  It  =  0 
nJ 


-O 
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We  shall  find  it  convenient  to  use  the  following  notational  convention. 

Let  Z  (s  =  1,  2,  . . . ,  ri)  be  matrices  and  x  -  (x^,  x^;  . . . ,  x^)  be  an  n-digit 


x 


binary  number.  Then,  we  shall  always  write  Z  and  Z(x)  to  denote 


(2.8) 


X  X  X 

ZX=Z,  XZ,  X...  xz  n 
12  n 


Z  (x)  =  Z  XZ  X  i  .  .  X  Z  r  ,  , 

v  1  sn  s  for  those  x  =x  =...=x  =1 

12  p  s,  s0  s  ■ 

^  1  2  p 


When  the  Zg  are  scalar  quantities  (say)  then 


(2.9) 


X 


z  =  z(x)  . 


3.  ■  The  General  Theory  of  Unequal  Numbers 

In  this  section  we.  shall  develop  the  general  theory  for  unequal  numbers.  Our 
procedure  will  be  to  first  find  estimates  for  the  various  interaction  parameters  and 
their  "variances  and  then  derive  the  associated  sum  of  squares  for  use  in  the 
analysis  of  variance.  The  estimation  of  the  various  interactions  constitutes  no  real 
problem;  the  difficult  problem  is  to  determine  the  appropriate  sums  of  squares.  . 

Let  W  =  W.  X  W_  X  ...  X  W  ;  then  it  can  easily  be  shown  that  the  estimable 
functions  for  t.  may  be  estimated  from 


(3.1) 


t  =  [I  - 


TW 

1' Wl 


]Y=  [I 


M]y 

T  T  J 


I 

where  1  is  a  v  x  1  column  vector  having  all  elements  unity ;  J  =  1  1  ■ .  and  I 
is  the  v  X  v  identity  matrix. 
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Consequently  we  have 
(3.2) 


?  TW  •  JW  » 

var  t  =  o-  [I  -  ]  N  1  [I  -  ] 

.  -V  L  v 


Define  the  mx  m  matrix  M  by 
s  s  s .  • 


(3.3) 


M.  =  [m  I  J  W  ]W-1 
s  ss  s  s  s 


Also  let 


(3.4) 


.  X 

s 


M  for  x  ,=  1 
S'-  s 


for  x  =0 


■.  ■  x  X1  xz  xn 

'  M  =  M.  X  M0  X  .  . .  X  M 
1  2  n 


x 


We  note  that.  M  may;  also  be  written  as 


■  MX=M(x)(IX)' 


where  M(x)  =  M  X  M  X  ...  X  M  and  x  =  x  =  . .  .  x  =1 
s,  s0  s  s.  s0  s 

1  Z  p  12  p 


with  the  remaining  x  =  0;  i.  e.  M(x)  is  the  direct  product  of  those  M 
s  s 

which  x  =  1  .  We  also  record  for  reference 
s 


(3.5) 


MW  =  m  I  -  J  W 
s  s  s  s  s  s 


/  M  W  a  =  m  a 
S  sss  s  s 


for 


M  W  1  =  O 

s  S  .s 
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After  some  algebra  one  can  show  that 


(3.6) 

and 


a  X  =  MX  W  t/v 


(3.7) 


var  aX  =  [MXWN_1  W(MX)  ]  o-2/v2  . 


.X 


Note  that  the  var  a  may  be  written  as 


var  a  X  =  M  (x)  [(f)'  W  N_1  W IX]  M  (x)  o-  2/v2  . 


The  variance  can  be  further  simplified.  For  this  purpose  define 


W 

s 


W  if  x  =  1 
s  s 


I  if  x  =  0 
s  s 


n  x 

wx  =  n  xw  s 

i  s 

S=1 


x 

Then  the  quantity  [(I  )  W]  may  be  written 


i  n 


x 


x  .  1— x 

T  A  r  S 


TX'  ,>x 


(i  }  w  =  n  x  (i  °)  w  =  w(x}  n  x  (i  )  w  =  w(x)  (i  }  w 


s=l 


s=l 


X  *  — *1  2^ 

The  diagonal  matrix  [(I  }  W  N  .  WI  ]  can  also  be  written 


(I*)’  WN_1WIX  =  W(x)  R(x)  W(x) 


where 


(3.8) 


R(x)  =  (ix)'  w1_x  n-1  w1_x  IX 
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«**  ^ 

Hence  the  variance  of  a  is 

(3.9)  var  aX  =  [M(x}  W(x)R(x}  W(x)  M(x)]cr^/v2  • 

Note  that  R(x)  and  W(x)  are  diagonal  matrices. 

It  remains  to  find  the  sum  of  squares  associated  with  a  .  We  shall 
find  this  making  use  of  a  result  on  quadratic  forms  recently  given  by  Rao  [1962J 

Lemma.  Let  X  =  (X  ,  . . .  9  X^)  have  a  singular  multivariate  normal 

distribution  with  E(X)  =  0  and  var  X  =  cr^  Yj  w^ere  the  rank  of  Yj  is  f  (f  <  n)  . 

2 

Then  a  necessary  and  sufficient  condition  for  the  quadratic  form  X'SX/tr  to 
have  a  chi-square  distribution  with  f  degrees  of  freedom  is  that  S  be  a  real 
symmetric  n  x  n  matrix  having  the  properties  that 

(i)  s  =  s  £s- 

c±i)  Z=  Zs  Z  • 

We  now  turn  our  attention  to  finding  the  matrix  S  of  the  above  lemma  when 

2=  M(x)  W(x)  R(x)  W(x)  M(x)  .  We  point  out  that  the  degrees  of  freedom 
***  x  ^ 

associated  with  a  is  f(x)  =  n(m-.-l)  (rank  of  var  a  )  .  Hence  there  will 

s=l  1 

exist  at  least  r(x)  =  m(x)  -  f(x)  linearly  independent  non-estimable  functions 
x 

of  a  .  These  non-estimable  functions  will  be  denoted  by 

(3.10)  K'(x)W(x)aX 

where  K'(x)  is  r(x)xm(x),  has  rank  r(x)  ,  and  W(x)  denotes  the  direct 

product  of  those  W  for  which  x  =  1  . 

s  s 


-10- 
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Define  the  m(x)xm(x)  matrix  V(x)  by 


(3.11) 


w~  • 

W(x)  M(x)  R  \x)  K(x) 

-1 

V(x) 

K(x)  U(x) 

K'  (x)  R-1(x)  O 

U(x)  K'(x) 

O 

where 


U(x)  =  [K'(x)  R~\x)  K(x)]'1 


(3.12) 

It  can  be  verified  that  the  matrix  S  of  the  lemma  is 

(3.13) 

Therefore,  the  required  sum  of  squares  is 

(3.14)  v2(aX)'  (V (x)  R~\x)  V(x)}(£X) 


S  =  v2  V  (x)  R  \x)  V  (x) 


The  above  sum  of  squares  still  requires  knowledge  of  the  matrix  V(x)  which 
is  not  known  explicitely.  Using  (3.  5),  we  can  write 


m  a  =  M  W  a 
s  s  s  s  s 


and  therefore 
(3. 15] 

Substituting  (3.15)  in  (3.14)  results  in 


m(x)  aX  =  M(x)  W(x)  aX 


^m(xp2  ^  "W (x)  M (x)  V (x)  R~\x)  V(x)  M (x)  W (x)  ^  (a  X) 


)j(a: 
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Using  the  relations  of  a  matrix  to  its  inverse  gives 

W (x)  M  (x)  V (x)  +  R_1(x)  K(x)  U(x)  K'(x)  =1  , 

K'(x)  R_1(x)  V(x)  =  O  . 

Consequently, 

W(x)  M(x)  V(x)  R  \x)  V(x)  M(x)  W(x}  =  R  \x)  [I  -  K(x)  U(x)  K‘(x)  R  \x)]  , 

which  follows  from  the  fact  that  the  matrix  in  square  brackets  in  idempotent. 
Therefore  the  sum  of  squares  can  be  written  as 

(3.16)  (^)2(aX)'  |r_1(x)  [I  -  K(x)  U (x)  K'(x)  R_1(x)]  j  (aX)  . 

Since  R(x)  is  a  diagonal  matrix,  the  main  computational  labor  is  in  computing 
the  r(x)  xr(x)  inverse  matrix 

U(x)  =  [K'(x)  R_1(x)  K(x)]_1  . 


The  sums  of  squares  for  main  effects  may  be  written  explicitely  as  r(x)  =  1 
In  this  case  for  (say)  a 


K*(x)=ls  ,  W(x)  =  Wg 


Rs  =  R(x) 


=  (1  W.XLW,X...XI  X...X1  W  )  N  (W.l.  XW.l  X.  .  .  XI  X,  .  •  X  W  1  ) 
v  1  2  2  s  nn/v1122  s  nn' 


1  -1  -1 
U  -  (1  R  1  ) 
s  s  s  s 
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The  sum  of  squares  for  the  main  effect  associated  with  is  thus 


(3.17) 


(—  )2  S'  {R  1  [I 
vm  1  s  s  L  s 
s 


J  tt¬ 
ys _ 

U 


]) 


a 

s 


4,  Proportional  Frequencies 

As  is  well  known,  the  case  of  proportional  frequencies  turns  out  to  be 
particularly  simple.  The  case  of  proportional  frequencies  arises  when 

n 


N  =  N  X  N2  X 


X  N  =  n  X  N 
n  t  s 


where  the 

Define  n 

s 


N  are  such  N  is  the  direct  product  of  all  N 
s  s 


=  1  N  1  ;  then  the  W  quantities  are 
s  s  s  s 


m 

W  =  =r^  N 
s  n  s 
s 


and 


n 

W  =  n  X  W 

T  S 

S=1 


Consequently  we  have 

1  -l  ,Tx. 1  x 

W(x)R(x}W(x]  =(IX)  WN  WIX  =  =  -f—  N(x) 

r  rn(x) 


x 


-  x  2  X1  X1  '  X2  X2  '  x 

vara  [M  N  (M  )  X  M  N  (M  )  X...XM  nN  (M  n)  1 

*  111  2  22'  nnvn'J 


and 
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Note  that 


VN.‘Ms‘>'=< 


n  M 
s  s 


for  x  =  1 
s 


n  for  x  =  0 
s  s 


Therefore  the  variance  can  be  written  as 

2 

(4.1)  var  a  X  =  ~  M(x) 

-  x 

as  n  =  rv  . 

The  sum  of  squares  associated  with  a  can  easily  be  written  by  noting 
that  the  second  term  in  (3,16)  is  a  null  matrix.  This  can  be  demonstrated 
by  writing 

(4.2)  R_1(x)  =  -N(x) 

v  n(x) 

and  thus 

K’(x)  R~\x)  a  X  K'  {x)  N(x)  aX  =  O  . 

v  n(x) 

X 

The  sum  of  squares  associated  with  a  is  then 

Hbr)2  dx)‘  r“Wux)  =  dx)‘  n(x)  (Sx)  . 

v  1  n(x) 


(4.3) 
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